Search CORE

54 research outputs found

Characterizing non-heroin opioid overdoses using electronic health records.

Author: Averitt Amelia J
Perotte Adler J
Slovis B. H.
Tariq Abdul A
Vawdrey David K
Publication venue: Jefferson Digital Commons
Publication date: 26/11/2019
Field of study

Introduction: The opioid epidemic is a modern public health emergency. Common interventions to alleviate the opioid epidemic aim to discourage excessive prescription of opioids. However, these methods often take place over large municipal areas (state-level) and may fail to address the diversity that exists within each opioid case (individual-level). An intervention to combat the opioid epidemic that takes place at the individual-level would be preferable. Methods: This research leverages computational tools and methods to characterize the opioid epidemic at the individual-level using the electronic health record data from a large, academic medical center. To better understand the characteristics of patients with opioid use disorder (OUD) we leveraged a self-controlled analysis to compare the healthcare encounters before and after an individual\u27s first overdose event recorded within the data. We further contrast these patients with matched, non-OUD controls to demonstrate the unique qualities of the OUD cohort. Results: Our research confirms that the rate of opioid overdoses in our hospital significantly increased between 2006 and 2015 (P \u3c 0.001), at an average rate of 9% per year. We further found that the period just prior to the first overdose is marked by conditions of pain or malignancy, which may suggest that overdose stems from pharmaceutical opioids prescribed for these conditions. Conclusions: Informatics-based methodologies, like those presented here, may play a role in better understanding those individuals who suffer from opioid dependency and overdose, and may lead to future research and interventions that could successfully prevent morbidity and mortality associated with this epidemic

Jefferson Digital Commons

Preserving Differential Privacy in Convolutional Deep Belief Networks

Author: A Armato
A Bandura
A Gottlieb
A Ortiz
A Perotte
A Perotte
C Dwork
Dejing Dou
E Choi
EW Jamoom
G Arfken
G Hinton
GE Hinton
H Li
HY Xiong
J Ma
J Wu
J Zhang
M Hay
M Helmstaedter
M Roumia
M Vlcek
MKK Leung
N Phan
N Phan
NhatHai Phan
R Fang
R Miotto
S Bach
S Hochreiter
SM Plis
T Lee
TJ Rivlin
W Rudin
Xintao Wu
Y Bengio
Y LeCun
Y Lecun
Publication venue
Publication date: 01/01/2017
Field of study

The remarkable development of deep learning in medicine and healthcare domain presents obvious privacy issues, when deep neural networks are built on users' personal and highly sensitive data, e.g., clinical records, user profiles, biomedical images, etc. However, only a few scientific studies on preserving privacy in deep learning have been conducted. In this paper, we focus on developing a private convolutional deep belief network (pCDBN), which essentially is a convolutional deep belief network (CDBN) under differential privacy. Our main idea of enforcing epsilon-differential privacy is to leverage the functional mechanism to perturb the energy-based objective functions of traditional CDBNs, rather than their results. One key contribution of this work is that we propose the use of Chebyshev expansion to derive the approximate polynomial representation of objective functions. Our theoretical analysis shows that we can further derive the sensitivity and error bounds of the approximate polynomial representation. As a result, preserving differential privacy in CDBNs is feasible. We applied our model in a health social network, i.e., YesiWell data, and in a handwriting digit dataset, i.e., MNIST data, for human behavior prediction, human behavior classification, and handwriting digit recognition tasks. Theoretical analysis and rigorous experimental evaluations show that the pCDBN is highly effective. It significantly outperforms existing solutions

arXiv.org e-Print Archive

ScholarWorks@UARK

Crossref

UARK (University of Arkansas )

Who are the Users of Speed Regulation Assistance? Comparing Driver Characteristics of Casual and Intensive System Users

Author: A. Perotte (578533)
D. J. Albers (293830)
E. Tabak (578532)
George Hripcsak (242191)
Noémie Elhadad (522968)
Publication venue: Iowa Research Online
Publication date: 18/06/2013
Field of study

Speed regulation assistance can contribute to road safety provided that drivers use the systems on a regular basis. With the objective to gain knowledge about drivers who use Cruise Control and the Speed Limiter, a comparison of the characteristics of casual and intensive users was performed with survey data. The results show that gender and annual mileage play a role for the usage frequency of Cruise Control, whereas the usage frequency of the Speed Limiter depends on age. Consistent effects of the car use for business matters and the use of other invehicle technologies were found on the usage frequency of both systems. The predominant motive to reduce speeding found for both systems corresponds with the objective of speed regulation assistance as a safety measure. It was complemented with a comfort benefit perceived by Cruise Control users

Iowa Research Online

FigShare

Predicting Multiple ICD-10 Codes from Brazilian-Portuguese Clinical Notes

Author: A Perotte
AEW Johnson
F Duarte
G Salton
J Huang
M Li
M Oleynik
M Subotin
P Bojanowski
PB Jensen
SVS Pakhomov
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 29/07/2020
Field of study

ICD coding from electronic clinical records is a manual, time-consuming and expensive process. Code assignment is, however, an important task for billing purposes and database organization. While many works have studied the problem of automated ICD coding from free text using machine learning techniques, most use records in the English language, especially from the MIMIC-III public dataset. This work presents results for a dataset with Brazilian Portuguese clinical notes. We develop and optimize a Logistic Regression model, a Convolutional Neural Network (CNN), a Gated Recurrent Unit Neural Network and a CNN with Attention (CNN-Att) for prediction of diagnosis ICD codes. We also report our results for the MIMIC-III dataset, which outperform previous work among models of the same families, as well as the state of the art. Compared to MIMIC-III, the Brazilian Portuguese dataset contains far fewer words per document, when only discharge summaries are used. We experiment concatenating additional documents available in this dataset, achieving a great boost in performance. The CNN-Att model achieves the best results on both datasets, with micro-averaged F1 score of 0.537 on MIMIC-III and 0.485 on our dataset with additional documents.Comment: Accepted at BRACIS 202

arXiv.org e-Print Archive

Crossref

A hierarchical method to automatically encode Chinese diagnoses through semantic similarity estimation

Author: A Aronson
A Perotte
A Rios
B McInnes
B Ribeiro-Neto
C Friedman
D Sánchez
H Li
H Schutze
H Wang
H Zhang
J Hornberger
K Lund
K O’Malley
L Chen
L Dai
L Larkey
L Lita
M Stanfill
Ming Yu
P Resnik
Q Liu
Q Liu
R Farkas
R Kavuluru
R Kukafka
R Mihalcea
R Rada
Runtong Zhang
S Boytcheva
S Meystre
S Pakhomov
S Patwardhan
S Pereira
T Cohen
T Landauer
T Pedersen
Wenxin Ning
X Cheng
Y Yang
Y Zhang
Z Harris
Z Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Temporal Properties of Diagnosis Code Time Series in Aggregate

Author: A. Perotte
G. Hripcsak
Publication venue: Institute of Electrical and Electronics Engineers (IEEE)
Publication date: 01/01/2013
Field of study

aerial view, housing, 8/21/201

Crossref

MIT Libraries Dome

Risk prediction for chronic kidney disease progression using heterogeneous electronic health record data and time series analysis

Author: Blei D.
Elhadad N.
Hirsch J. S.
Perotte A.
Ranganath R.
Publication venue: Donald and Barbara Zucker School of Medicine Academic Works
Publication date: 01/01/2015
Field of study

BACKGROUND: As adoption of electronic health records continues to increase, there is an opportunity to incorporate clinical documentation as well as laboratory values and demographics into risk prediction modeling. OBJECTIVE: The authors develop a risk prediction model for chronic kidney disease (CKD) progression from stage III to stage IV that includes longitudinal data and features drawn from clinical documentation. METHODS: The study cohort consisted of 2908 primary-care clinic patients who had at least three visits prior to January 1, 2013 and developed CKD stage III during their documented history. Development and validation cohorts were randomly selected from this cohort and the study datasets included longitudinal inpatient and outpatient data from these populations. Time series analysis (Kalman filter) and survival analysis (Cox proportional hazards) were combined to produce a range of risk models. These models were evaluated using concordance, a discriminatory statistic. RESULTS: A risk model incorporating longitudinal data on clinical documentation and laboratory test results (concordance 0.849) predicts progression from state III CKD to stage IV CKD more accurately when compared to a similar model without laboratory test results (concordance 0.733,

PubMed Central

Hofstra Northwell Academic Works (Hofstra Northwell School of Medicine)

Dynamical phenotyping: using temporal analysis of clinically collected physiologic data to stratify populations.

Author: A Perotte
D J Albers
E Tabak
George Hripcsak
Noémie Elhadad
Publication venue: Public Library of Science (PLoS)
Publication date: 01/01/2014
Field of study

Using glucose time series data from a well measured population drawn from an electronic health record (EHR) repository, the variation in predictability of glucose values quantified by the time-delayed mutual information (TDMI) was explained using a mechanistic endocrine model and manual and automated review of written patient records. The results suggest that predictability of glucose varies with health state where the relationship (e.g., linear or inverse) depends on the source of the acuity. It was found that on a fine scale in parameter variation, the less insulin required to process glucose, a condition that correlates with good health, the more predictable glucose values were. Nevertheless, the most powerful effect on predictability in the EHR subpopulation was the presence or absence of variation in health state, specifically, in- and out-of-control glucose versus in-control glucose. Both of these results are clinically and scientifically relevant because the magnitude of glucose is the most commonly used indicator of health as opposed to glucose dynamics, thus providing for a connection between a mechanistic endocrine model and direct insight to human health via clinically collected data

CiteSeerX

Columbia University Academic Commons

Directory of Open Access Journals

PubMed Central

A Bayesian Analysis of Dynamics in Free Recall

Author: Adler J. Perotte
David M. Blei
Kenneth A. Norman
Per B. Sederberg
Richard Socher
Samuel J. Gershman
Publication venue
Publication date: 01/01/2009
Field of study

We develop a probabilistic model of human memory performance in free recall experiments. In these experiments, a subject first studies a list of words and then tries to recall them. To model these data, we draw on both previous psychological research and statistical topic models of text documents. We assume that memories are formed by assimilating the semantic meaning of studied words (represented as a distribution over topics) into a slowly changing latent context (represented in the same space). During recall, this context is reinstated and used as a cue for retrieving studied words. By conceptualizing memory retrieval as a dynamic latent variable model, we are able to use Bayesian inference to represent uncertainty and reason about the cognitive processes underlying memory. We present a particle filter algorithm for performing approximate posterior inference, and evaluate our model on the prediction of recalled words in experimental data. By specifying the model hierarchically, we are also able to capture inter-subject variability.

CiteSeerX